Leveraging efficient indexing schema to support multigraph query answering
نویسندگان
چکیده
Many real world datasets can be represented by graphs with a set of nodes interconnected with each other by multiple relations (e.g., social network, RDF graph, biological data). Such a rich graph, called multigraph, is well suited to represent real world scenarios with complex interactions. However, performing subgraph query on multigraphs is still an open issue since, unfortunately, all the existing algorithms for subgraph query matching are not able to adequately leverage the multiple relationships that exist between the nodes. Motivated by the lack of approaches for sub-multigraph query and stimulated by the increasing number of datasets that can be modelled as multigraphs, in this paper we propose IMQA (Index based Multigraph Query Answering), a novel algorithm to extract all the embeddings of a sub-multigraph query from a single large multigraph. IMQA is composed of two main phases: Firstly, it implements a novel indexing schema for multiple edges, which will help to efficiently retrieve the vertices of the multigraph that match the query vertices. Secondly, it performs an efficient subgraph search to output the entire set of embeddings for the given query. Extensive experiments conducted on real datasets prove the time efficiency as well as the scalability of IMQA. RÉSUMÉ. De nombreuses données réelles peuvent être représentées par un réseau avec un ensemble de nœuds interconnectés via différentes relations (i.e. les réseaux sociaux, les données biologiques, les graphes RDF). Ce type de graphe, appelé multigraphe, est tout à fait adapté à la représentation de scénarios réels contenant des interactions complexes. La recherche de sousmultigraphe dans des multigraphes est un domaine de recherche ouvert et malheureusement les algorithmes existants pour faire de la recherche de sous-graphe ne sont pas adaptés et ne peuvent pas prendre en compte les différentes relations qui peuvent exister entre les nœuds. Motivés par le manque d’approches existantes et par le nombre croissant d’applications qui peuvent être modélisées via des multigraphes, nous proposons dans cet article IMQA un nouvel algorithme pour extraire tous les sous-multigraphes inclus dans un grand multigraphe. IMQA comporte deux étapes principales. Tout d’abord il implémente une nouvelle structure d’indexation pour les relations multiples qui est utilisée pour rechercher efficacement les sommets du multigraphe qui correspondent aux sommets de la requête. Ensuite, il réalise une recherche efficace de Ingénierie des systèmes d’information – n 3/2016, 53-74 54 ISI. Volume 21 – n 3/2016 l’ensemble des sous-multigraphes correspondant à une requête donnée. Les nombreuses expérimentations menées sur des jeux de données réelles ont montré l’efficacité et le passage à l’échelle de IMQA.
منابع مشابه
Ja n 20 06 Efficient Query Answering over Conceptual Schemas of Relational Databases
We develope a query answering system, where at the core of the work there is an idea of query answering by rewriting. For this purpose we extend the DL DL-Lite [5] with the ability to support n-ary relations , obtaining the DL DLR-Lite, which is still polynomial in the size of the data [3,4]. We devise a flexible way of mapping the conceptual level to the relational level, which provides the us...
متن کاملEfficient Query Answering over Conceptual Schemas of Relational Databases : Technical Report
We develope a query answering system, where at the core of the work there is an idea of query answering by rewriting. For this purpose we extend the DL DL-Lite [5] with the ability to support n-ary relations , obtaining the DL DLR-Lite, which is still polynomial in the size of the data [3,4]. We devise a flexible way of mapping the conceptual level to the relational level, which provides the us...
متن کاملAnswering Queries Using Views: A Survey
The problem of answering queries using views is to find efficient methods of answering a query using a set of previously materialized views over the database, rather than accessing the database relations. The problem has recently received significant attention because of its relevance to a wide variety of data management problems. In query optimization, finding a rewriting of a query using a se...
متن کاملQuerying Transaction-Time Databases under Branched Schema Evolution
Transaction-time databases have been proposed for storing and querying the history of a database. While past work concentrated on managing the data evolution assuming a static schema, recent research has considered data changes under a linearly evolving schema. An ordered sequence of schema versions is maintained and the database can restore/query its data under the appropriate past schema. The...
متن کاملA Scalable and Efficient Query Answering for a Context and Schema Mediation
This paper presents a generic, scalable and efficient query answering for semantic interoperable information systems. We propose a generic multi agent architecture that supports our approach. Moreover, we define a semantic proximity allowing to separate the semantic interoperable information systems in segments allowing encircled efforts of query answering. Another advantage of our approach con...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Ingénierie des Systèmes d'Information
دوره 21 شماره
صفحات -
تاریخ انتشار 2016